Measuring the dynamic encoding of speaker identity and dialect in prosodic parameters

نویسندگان

  • Michael Barlow
  • Michael Wagner
چکیده

This paper describes a methodology, and the results stemming from it, for analysing the dynamic encoding of speaker identity and dialect in prosodic parameters. A method based on employing properties of the well known Dynamic Time Warping (DTW) algorithm’s path of best match allows the separation of purely dynamic from static properties of acoustic parameters and hence their evaluation as to dynamic encoding of speaker characteristics. Nineteen adult speakers of Australian English were recorded uttering a set of four sentences on five separate occasions over a period of at least one week. The prosodic parameters F0, shorttime energy, zero crossing rate and voicing were extracted for all data and analysed as to their dynamic encoding of speaker identity and dialect. Discriminate analysis (for speaker identity) and correlation analysis (for speaker dialect) analysis showed higher dynamic encoding of identity (75%) and dialect (0.58) than static encoding (55% and 0.45 respectively). Normalisation of all parameters into the range 0—1 reduced discriminate and correlation scores to 70% and 0.54 respectively. Contrasting the warp path parameters with the more conventionally employed DTW distance showed that the warp path parameters better measured speaker identity (72% versus 54%) and speaker dialect (0.56 versus 0.31) encoding. Individual analysis of the prosodic parameters shows a far higher encoding of identity and dialect in F0, though all four parameters encode dialect and identity.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Variation Adds to Prosodic Typology

Variation has not been a major concern of prosodic typologists. Frequently, it is treated as noise in the data and held to conceal what is really important about the prosodic structure of the language. Consequently, most investigations are restricted to a single standard variety and cross-speaker variation is ignored or masked by statistical processing. The results are often assumed to be repre...

متن کامل

Text independent speaker recognition using micro-prosody

The acoustic aspects that differentiate voices are difficult to separate from signal traits that reflect the identity of the sounds. There are two sources of variation among speakers: (1) differences in vocal cords and vocal tract shape, and (2) differences in speaking style. The latter includes variations in both target vocal tract positions for phonemes and dynamic aspects of speech, such as ...

متن کامل

Using prosody and phonotactics in Arabic dialect identification

While Modern Standard Arabic is the formal spoken and written language of the Arab world, dialects are the major communication mode for everyday life; identifying a speaker’s dialect is thus critical to speech processing tasks such as automatic speech recognition, as well as speaker identification. We examine the role of prosodic features (intonation and rhythm) across four Arabic dialects: Gul...

متن کامل

The role of prosody in dialect authentication: Simulating Masan dialect with Seoul speech segments

The purpose of this paper is to examine the viability of simulating one dialect with the speech segments of another dialect through prosody cloning. The hypothesis is that, among Korean regional dialects, it is not the segmental differences but the prosodic differences that play a major role in authentic dialect perception. This work intends to support the hypothesis by simulating Masan dialect...

متن کامل

The Discursive Construction of “Native” and “Non-Native” ‎Speaker English Teacher Identities in Japan: A Linguistic ‎Ethnographic Investigation

Recent poststructuralist theories of identity posit identities as being discursively constructed in interactions with society, institutions, and individuals. This study used a Linguistic Ethnographic framework to investigate the discursive identity construction of two English teachers, one ‘non-native’ English speaker, and one ‘native’ English speaker, teaching English in a tertiary institution...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998